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Abstract. We generalize a family of variation norm estimates 
(Theorem II. ip of Lepingle with endpoint estimates of Bourgain 
and Pisier-Xu to a family of variational estimates for paraproducts 
(Theorem 11.21) . both in the discrete and the continuous setting. 
This expands on work of Friz and Victoir, our focus being on the 
continuous case and an expanded range of variation exponents. 



1. Introduction and Main Theorem 

In this paper, a band limited function on H with band width N has 
Fourier transform supported in {2~ N < |£| < 2^}. A collection of func- 
tions (/i) iG 2 sucn that the dilates fi(2 z x) are band limited with uniform 
band width is called a continuous Littlewood-Paley family. The collec- 
tion is called a discrete Littlewood-Paley family if fa is spanned by the 
Haar functions associated to dyadic intervals of length 2 1_l , it is then 
essentially a martingale. The following theorem is known: 

Theorem 1.1. Given 1 < p < oo, l<r<2,r<t< oo, and N > 0, 
there is a constant C such that for any collection {Nk) ke .f$ of measur- 
able functions on R and any Littlewood Paley data fi ( continuous with 
bandwidth N or discrete) the following holds: 
If r < 2 or t > r, then 

(1) IKE I E fWX < C\\(Ji)\\p,r ■ 

k Ar fc _i<i<iV fc 

If r = 2 and t = r, then for every A > 

(2) HAOKM E M > < c \U)hr ■ 

N k _!<i<N k 
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Here we have set 

(3) ii(/i)ii ft r:=ii(Ei/*r) 1/r iip • 

i 

In the discrete case, these inequalities are special cases of known 
inequalities for martingales: ([[]) is Lepingle's inequality [6] if t > r and 
Pisier and Xu's inequality [11] if t = r. Inequality d2J) was used by 
Bourgain [Tj to give a proof of Lepingle's inequality. Inequality (J2J) for 
r < 2 or t > r follows from (Tj[|) by Chebysheff's inequality 

For r = 2, the continuous case of Theorem 11.11 has been observed 
by several authors ([H IH E]) in the following setting. Define for some 
fixed function / 

(4) f t :=fa*f, fa(x) := 2^(2V) , 

where is some band limited test function. Then ([3]) for r = 2 is the 
norm of a Littlewood Paley square function and dominated by ||/|| p , 
and the domination is an equivalence if fa is chosen appropriately. We 
present a proof of the continuous case r < 2 of Theorem 11.11 in Section 
[2] as model for the unfortunately somewhat technical Section 
We prove the following bilinear variant of Theorem: 11.11 

Theorem 1.2. Given 1 < p, q < oo, 1 < r, s < 2, 1/t < 1 /r+l/s, N > 
there is a constant C such that for any collection (Nk) of measurable 
functions and any Littlewood Paley data /j and gj (each continuous 
with bandwidth N or discrete) the following holds: 
If max(r, s) < 2 or t > rs/(r + s) , then 

(5) ll(£l E fi9j\W\^<C\\m^\\(3M^ ■ 

k N k ^ 1 <i<j<N k 

If max(r, s) = 2 and t = rs/(r + s), then 

(6) ||A(|t{*:| £ fi9 j \>^}) 1/t \\^<C\\m P A\(9j)Ls ■ 

N k -i_<i<j<N k 

For r = s = 2 and p = q, fi = gi, the discrete case of (|3j) is a 
special case of a martingale inequality of Friz and Victoir [2j. All 
other instances of this theorem appear to be new. Inequality ([6]) for 
max(r, s) < 2 or t > follows from §5§ by Chebysheff. Note that 
Theorem 11.21 has a continuous, a discrete, and a mixed continuous and 
discrete case, though the latter is maybe less natural. 

Theorems II .11 and II .21 have alternative formulations, where fi and gj 
are not required to be band limited. Instead, in the continuous case 
we replace fi and gj on the left-hand-side by fa * fi and fa * gj with 
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(pi as in (jl]), while in the discrete case we use Haar projections on the 
left-hand-side. In particular, we have 

Theorem 1.3. Let (pi be as in (TJJ). Given 1 < p, q < oo, 1 < r, s < 2, 

1A — l/ r + V s ? there is a constant C such that for any collection 
(iVfc) of measurable functions and any sequences of functions f = (fi), 
g = (gj) with the normalization 

(7) ll(/0IUr=||(»)|| W = l 

i/ie following holds: 

If max(r, s) < 2 or £ > r s/(r + s) , i/ien 

(8) i E (^»/i)(^*»)i t ) 1/ *n i « f <g • 

fe N k _!<i<j<N k 

J/max(r, s) = 2 and i = rs/(r + s) ; i/ien 

(9) HACK*:! E (^*/ i )(^-**)i>A}) 1/i ||^<C . 

N k -!<i<j<N h 

Theorem (11. 2p implies Theorem 11.31 by an application of the former 
to the Littlewood Paley family (0j * fi) and by the Fefferman-Stein 
inequality [12] for the Hardy Littlewood maximal operator M: 

\m*fi)\\ P , r <c\\(Mm Pir <c\\(m P ,r . 

Conversely, the continuous case of Theorem 11.21 follows by specializ- 
ing Theorem (II. 3p to a continuous Littlewood Paley family (fi) and 
choosing with large band width such that (pi * fi = fi- We will use 
both formulations, Theorem 11.21 has simpler notation while Theorem 
11.31 works better with truncations of / and g and interpolation. 
Another alternative formulation arises from setting 

/j := ^2 a i<\>i 

|/|=2 1 - i 

where ai are coefficients and for each dyadic interval / the function (pi 
is an L°° normalized band limited bump function adapted to I in the 
sense of [12j. In particular it satisfies for e > 

\Mx)\<C(l + (x-c(I))/\I\)-^ , 

which allows to estimate by 

C\\(M ^ih)\\ P ,r<C\\(J2H r ) 1/r \\L P ( x ). 

1/1=21-* x£l 

The L°° normalization of (pi makes aj have the same normalization as 
the values of the corresponding fi. 
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The restriction % < j in Theorem 11.21 is analogous to the paraproduct 
[H] of two functions / and g: 

P{f,g) = Y ; (<l>i*f)(<!> j *9) ■ 

i<j 

The endpoint of Theorem 11.21 for t = oo is a classical maximal para- 
product inequality, for example in the convolutional case we have 

Proposition 1.1. For 1 < p, q < oo, 

(10) || SUP | ^ (fa* f)( < t>j*9)\\\pq/(p+q)<Cp, g \\f\\p\\g\\ g ■ 

No ' Nl 7V <i<i<iVi 

Inequality §Q has a trivial endpoint for r = t = 1 by the triangle 
inequality. Similarly, the endpoint of (J5J) for r = 1 or s = 1 is easy 
and elaborated in detail in Proposition 14.11 This endpoint is proved 
by reduction to Theorem 11.11 and used as interpolation endpoint to 
prove part of Theorem 11.21 Note that while classical variation norms 
as in (pQ) are somewhat pointless for t < 1, because then the choice of 
consecutive points Nk+i = Nj. + 1 is extremal, the bilinear variational 
expression in (jSJ) is meaningful for t > 1/2. 

We make two simple observations on Theorem 11.21 The variant of 
(J5J) without the paraproduct restriction % < j follows from Theorem ll.il 
by Holder's inequality. The high exponent case t > 2 of Theorem 11.21 
has a simple proof that we sketch in the discrete case. Let A m denote 
the projection onto the Haar functions associated to dyadic intervals 
of length 2 1 ~ m . Then 

A m(^/i^) =^2fi9m , 

i<j i<m 

N k -!<i<j<N k AT fc _ 1 <m<AT fe i<j «<iVfc-i N k ^<j<N k 

The first term on the right hand side can be estimated by (Tj[|) and 
f JTQj) . while the second term can be estimated by the Hardy Littlewood 
maximal theorem and ([Tj) applied to the terms terms involving / and 
g respectively. 

We became interested in variational estimates for paraproducts while 
studying LP estimates for a variational expression of the form 

(11) (£ \ [ m)9(v)^ +v) dtd V n i/r . 

k ~'Affe-i<C<»?<A r fc 

This can be viewed as a bilinear analogue of the variation norm Car- 
leson operator studied in [10] or as variational variant of the bi-est 
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operator introduced in [9j. It requires time-frequency analysis to be 
understood, and Theorem 11.21 provides the related lacunary estimates. 

The general type of bilinear variational estimate as in Theorem 11.21 
and (fTTl) is motivated by Terry Lyons' [7] theory of ODE with rough 
driving terms. This theory bootstraps Theorem ll.2l to the diagonal case 
Pi = ■ ■ ■ = p m of the following multilinear generalization of Theorem ll.2l 

Proposition 1.2. For 2/M < r < oo, 1 < p m < oo, and f m G 

L Pm (H n ) for m = 1, . . . , M, we have with 1/p = J2 m 1/Pm : 

iisu P (^i Yi n(<t>i m *f m )n i/r \\p<cii\\f m \\ pm . 

Ni — — 

k N k _ 1 <ii<---<i M <N k m m 

The non-diagonal case is likely to follow from a multilinear general- 
ization of arguments as in this paper, though the exact elaboration of 
the details is beyond the scope of this paper. 

Further consequences in the case of martingales are discussed in [2]. 
The authors would like to thank Terry Lyons for pointing out Reference 
[2] after circulation of an earlier draft of this paper. 

2. Proof of Pisier and Xu's inequality, continuous case 

Recall that Pisier and Xu's inequality is the case t = r < 2 of 
inequality ([T]). In the diagonal case, p = t, inequalities ([T]) and (J5J) 
can be written as strong and weak type LP estimates (see [12] for this 
terminology) for mappings from functions on the measure space R x Z 
to functions on the measure space R x No: 

(12) || Yl <Pi* fi{x)\\ L p(x,k)<C\\fi{x)\\ L P {xA , 

Af fe _i(a;)<i<Af fe (a;) 

(13) !{(*,*): I E <t>i*fi(x)\>*}\<C*~lfi(x)\\h(x,i) ■ 

A r fe _i(x)<i<V fe (x) 

Here we use the analogue setup as in Theorem 11.31 For a proof of 
ffT3"|) we refer to [3]. The endpoint of f[T2"j) for p = 1 follows from the 
triangle inequality. Hence f[T2"j) follows by Marcinkiewicz interpolation 
from f[T3"j) . which completes the discussion of the diagonal case. 

Starting from this diagonal case, we shall lower and raise the expo- 
nent p by Calderon-Zygmund decomposition and sharp function tech- 
niques respectively. For the sub-diagonal case, p < t, fix r and employ 
Marcinkiewicz interpolation for V- valued functions. The interpolation 
endpoints are the diagonal case p = r and the weak type bound 

(14) \{x-.(£\ £ ^*Mx)n l/r >\}\<j\\(Y,\m 1/r h 

k N k _ 1 <i<N k i 
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at L l . The latter follows via a Calderon-Zygmund decomposition of 
the vector function / = (fa) at level A. Let 

E = {x:M((J2\fi\ r ) 1/r )>V 

i 

and let I be the collection of maximal dyadic intervals contained in the 
set E. Split / as 

f = g + b = g + J2 b i > 
I 

where on each interval I the function g is constant equal to the average 
of / on I, and each 6j is / — g restricted to I. It suffices to prove (TH|) 
separately with / replaced by g and b on the left-hand-side. But 

110, * MUw) ^ ^i 7 ! min ( 2 VI> (2i/|)" £ ) 

by smoothness (|/| < 2~*) and decay (|/| > 2~ l ) estimates for 0j. 
Hence, by embedding I 1 into l r , 

ii(E i E ^*E^n 1/r iiLH(U3/) C ) 

< ^E E * MUw) < CA|E| < C||/|| lir . 

/ i 

Then (TH1) for 6 follows from Chebysheff's inequality. On the other 
hand, 

||<7||; >P < cx r - l \\ g \\ lir < cx^yw^ 

and (|14|) for g follows from the known diagonal estimate and Chebysh- 
eff's inequality. This completes the proof of the sub-diagonal case. 
For the super-diagonal case, p > r, consider the sharp function 



(15) 



(Tf)\x) = supinf — / \Tf(y) - c| dy 

xEl c \l\Jl 



Tf(x) = sup(^ | Yl <k*M x )\ 

(N k ) k N k _!<i<N k 



r\l/r 



Here the supremum is over all sequences (Nk). The LP norms of Tf 
and (Tfy are comparable [12], hence it suffices to estimate pointwise 

(16) (Tf)t(x)<CM r ((J2\fi\ r ) 1/r ))(x) , 

i 

where 



M r h(x) := (sup^ / \h(y)\ r dy) 
xei |J | J i 



l/r 



Fix an interval / and let /, be a constant function on R whose value 
equals the average of 4>i*fi on / if 2~ % > \I\ and equals zero if 2r % < \l\ . 
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Instead of taking infimum in c, we evaluate the definition of the sharp 
function (T/)" with 

c = su P QT| f*\ r ) 1/r ■ 

k N k _ 1 <i<N k 

By the general norm inequality |||a|| — < \\a — b\\ we have 

|su P (]T| <t>i*My)\ r ) 1/r -c\ 

(N k ) k N k _i<i<N k 

^ s »p(El E <f>i*fi(y)-ky)P l/r 

We write 



k N k ^<i<N k 



0,. *fi-fi = hf ] + hf + hf 

where 



hf = hf = 



for 2 1 > \I\, while for 2 1 < \I\ we have h}f = and 

Estimating the three summands separately, we have for , using em- 
bedding of l l into V and smoothness of (pi* fi, 

t^/ su p(EI E ^\y)\ r Y ,r dy 

^Tfif E 1^)1 */<^ E ( 2 V\)Mfi(x) . 

' ' 1 2- 4 >|7"| 2- < >|J| 

This is dominated by the right hand side of ({16]) . By the diagonal 
estimate, we have 

^/ su P(El E \hf\y)\ r ) 1/r ) dy 

^ci/r^iK^I/.is/D^II, , 

i 

which is again bounded by the right hand side of fll6p . Finally, we have 
by decay estimates for 0j and embedding of I 1 into l r , 

^/sup( \hf\y)Y) 1,r dy 

4, / E \hf\y)\dy<C (2V\r e Mf, 



< — I 2^ \ h r(y)\dy<c ^ [T\i\r Mf.i.r) 

2-*<|J| 2-*<[J| 
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which is again bounded by the right hand side of (|TB|1 . This concludes 
the proof of the super- diagonal case. 

3. Reduction of Theorem 11.21 to the case t = rs/{r + s) 

If max(r, s) < 2 we simply observe that (jSJ) is monotone in t to reduce 
to the case t — rs/(r + s). If max(r, s) = 2 we need to show that the 
relatively weaker estimate (EJ) for some t implies the relatively stronger 
estimate (EJ) for t > t. This will be a bilinear variant of the argument 
used by Bourgain [TJ. 

We apply Marcinkiewicz interpolation in the following form. 

Proposition 3.1. Let f , g denote measurable functions on measure 
spaces X, Y with values in Banach spaces F and G respectively. Sup- 
pose we have a bi- quasi- sublinear operation (f, g) — > T(f, g) producing 
some measurable function T(f, g) on a measure space Z . Here we mean 
quasi- sublinear in each argument, which is 



for all corners (p, q) of an axis parallel rectangle whose interior contains 
the point (po,qo), then we have the strong type estimate 



The proof of this proposition follows the standard Marcinkiewicz 
argument, in the multilinear setting described in Janson [3]. One splits 
both functions / and g according to small and large values (according to 
some level A that is later integrated on) and estimates the four resulting 
terms of T(f, g) using the assumed estimates. While Janson requires 
sublinearity in each argument, the adaption to quasi-sublinearity is not 
difficult since we split T(f,g) only into four terms for each level A. 

Fix p, q, r, s, t as in the theorem and let t > t. We shall work in 
the setting of Theorem 11.31 and deduce (jSj) for t from (Q for t. By 
Proposition (13. ip with F = l r (Z) and G = I s (Z) it suffices to show that 
for any A > and for any / = (fi),g = (gj) with normalization ((7|) we 
have 



T(f + f,g)<C(T(f,g)+T(f,g)) 




\\T(f > 9)\\p qo/(po+qo) - C\\f\\p \\g 



(17) 




k 
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where 

F(k) = \ E i</>i*fiM*9i)\ ■ 

AT fe _i<i<j<iV fe 

Indeed, since the problem is invariant under dilation of the x axis by 
powers of 2, it suffices to prove (117j) for A = 1. Let 

E = {x : supF(A;) > 1} . 

k 

Then by ([9]) for the exponent t and with u = pq/(p + q) we have 



\E\ < J \{k : F(k) > < C . 

Considering level sets of F(k), for x E we have 

^F(A;)*° < C^2 nt0 |{A; : F(k) > 2"}| . 

fe n<0 

By Holder together with a geometric sum if u/t > 1 and by monotonic- 
ity of l v (Z) in v if u/t < 1 we obtain 

(^F(A;) t0 ) u/i < C^2 n(1 - e)u * o/ *|{fc : F(Jfe) > 2 n }| u/ * 

fc n<0 

for some small e > 0. Using Chebysheff we have 

\{x £E : VF(fc)* > 1}| < f (VF(A;)* ) U /' 
k Jr \ e k 

< c^2 n(1 - £)Mio/i I \{k: F(k) > 2 n }\ u/t < C E 2 n(1 ~ e)ltW '2~ rai . 

n<0 n<0 

In the last inequality we have applied The right-hand-side is sum- 
mable for sufficiently small e, since to > t- This proves (fT7j) and com- 
pletes the reduction to the case t = rs/(r + s) for Theorem II. 3 1 Adap- 
tions for the discrete and mixed cases are not difficult. We shall assume 
t = rs/(r + s) throughout the rest of this paper. 

4. The endpoint at r = 1 or s = 1. 

If r = s = 1, hence t = 1/2, we have the following trivial observation 
for < p, q < oo, which holds for arbitrary sequences (/$) and (gj): 

(is) ike i E /^r /2 ) 2 iu/ (P+ ,) 



fe JV fc _i<i<i<JV, 



<n(£( E i^d 1/2 ( E i*-i) 1/2 ) 2 iu^ 



fc N k -x<i<N k N k -x<j<N k 
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< ll(El^D(El^i)IU(p+ ? ) ^ ll(/<)IUIG&)lki ■ 

» 3 

Consider just one of r, s equal to 1, by symmetry of the argument 
below we may assume r — 1. We then have the following result: 

Proposition 4.1. Let < p < oo and 1 < q < oo and assume 
t = s/(l + s). Let Nk by measurable functions, (/») any sequence of 
measurable functions and let (gj) be a Littlewood Paley family. 
If 1 < s < 2, then 



(!9) IKE I E /^il^^il^^ciK/OIUilK^II 

k 7V fc _i<i<j<7V fc 



If s = 2, then 

(20) \\X(Uk:\ E fr9i\>\}) l/t \\^ q <C\\m P M9M g ,s • 

N k ^<i<j<N k 

Inequality ( !T9|) follows quickly from Theorem 11.11 Note that 

i E f*9i\<( E su p i E 9i\) ■ 

N k -!<i<j<N k N k _i<i<N k N k-i<*<N k iKj < Nk 

Then estimate the left-hand-side of CHI with Holder's inequality by 

)l/s 
\\q ■ 
k ^ K -^ k iK ^ Nk 

Introducing for each x a new sequence which consists of the sequence 
Nk{x) interlaced with extremal choices of i where applicable, we can 
estimate the second factor by Theorem 11.11 and conclude (TT9|) . 

It remains to prove (1201) . It will suffice to prove the analogue in- 
equality with gj replaced by <f)j * gj on the left-hand-side and gj not 
necessarily bandlimited as in Theorem (11.31) . We will use Proposition 
13.11 Since the quantity 

(21) J2 Ui){<t>i*9 j )\>x}) 1/t 

N k -!<i<j<N k 

is not quasi subadditive, we shall replace it by an equivalent quasi 
subadditive quantity. For A > define p\(x) = min(|x|,A) and note 
that p\(x — y) defines a metric distance between x and y. Hence the 
operation T\ with 



(22) T A (/,^) = A- 1 (^p A ( (/OWi * 9j)) 2t 

k N k _!<i<j<N k 
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is quasi subadditive. It dominates ( 12T1) and hence it suffices to show 
(here r = 1) 

\\Uf,g)\\^ q <c\\f\\ p>r \\g\\ q ,s . 

By Proposition 13.11 it suffices to prove 

(23) \{x : T\ (/, g) > < C^\\f\\ p , r \\g\U,s 

with C independent of A and \i. Switching back to expression (|2"T|) we 
will prove 

(24) {A|{A;:| £ * 9j )}\ > A}| V * > 



N k ^<i<j<N k 



< Cfi *+'(\\f\\p,r\\g\\q*)*+* ■ 

We argue that this is sufficient to conclude (1231 . Consider for fixed x 
the level sets of the function 

JVk_i<i<i<Nfc 

i.e. the sets of k for which 2™A < F(k) < 2 (n+1) A for n < 0. Using 
geometric decay of 2 en for some small e > we obtain 

T x (f,g) < Cj2 2(2 ~ €)n M{k ■■ F(k) > 2"A}| 1 /* . 

n<0 

Then we can estimate 

\{x:T x (f,g) >t i}\ 



J2\{ x: 2 {2 ' e)n X\{k : F(k) > 2 n X}\ 1/t > c2 ne /i} 



< 

n<0 



<CE(2( 2 - 1 )»-^(||/|| p , r ||^|| g , s )^ 

n<0 



Taking e < 1/2 we obtain 

Now we prove (|24|) . By dilating the x-axis, we may assume that 
the right-hand-side of (1241) is C. By multiplying /, g, A, and ji by 
appropriate constants we may assume the normalization (jTj) and by 
the previous also ji = 1. We write 

\{k:\ Yl (fi)(h*9 j )\>X}\< 

Af fc _i<i<j<Affc 

\{k: Yl \f>\> Xt }\ 

N k _!<i<N k 

+ \{k : sup | V (f> j *g j \ > \ t/s }\ . 
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Therefore the left-hand side of ( |24"1) has the upper bound 
{x:X\{k: l/d>A 4 }| lA >c} 



+ 



JV fc _i<i<iV fc 

Ix : \\{k : sup | V] fa * gj\ > A* /s } | 1/4 > cj 

k 1 fc Kj<N k 

Using Chebysheff twice, the first term can be estimated by 
\{x:'£\f i \><?}\<C\\{f i )\\l t i ■ 

i 

The second term can be estimated by 



C 



\^ s \{k: sup | V fa * gj \ > A*/ s }| 1/s 



by Theorem II. 1[ applied for a sequence iV^ interlaced with elements i. 
This proves (I24|) and completes the proof of Proposition 14.11 

5. The diagonal case of Theorem 11.21 and L 2 theory 



The goal of this section is to prove Theorems 11.21 and 11.31 for one 
pair of exponents p, q for any given triple r, s, t with t = rs/(r + s). In 
Section El we will raise and lower p, q using a bilinear variant of the 
argument in Section [21 and thereby complete the proof of Theorem 11.21 
As in the previous section, the desired estimates are technically easier 
under some diagonal assumptions on p, q. We will assume throughout 
this section p = r and q = s and in particular t = pq/(p + q), which 
allow us to turn @ into a weak type formulation analogous to ( IT3"]) . 
namely the estimate (formulation as in Theorem II. 3p 



(25) 



Nu 



* fi) (fa *9j)\> Mdx< CX 

i<i<j<N k 

with the normalization assumption (JTj). 

Note that for r = 1 or s = 1 we have already proven (1251) in the 
previous section for many values of p, q. By interpolation it suffices 
to prove (T2"5l) for r = s = 2. Proposition 13.11 applied to the square 
with corners (2,2), (2,1), (1,2), (1,1) will prove the diagonal case of 
dHj), hence (^o§, in the open interior of the square. Note that we apply 
Proposition O with F, G = € and X, Y = R x Z. 

On the edges of the square we only need to prove the weak type 
estimate (1251) . which does not require Marcinkiewicz interpolation but 
simple interpolation by one time truncation of / or g and applying the 
endpoint estimates. 
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The rest of this section will be concerned with proving the case r = 
s = p = q = 2 and t = 1 of (1231) . 

5.1. Discrete case, stopping times. As first step, we assume here 
that (Nk) is a stopping time, which means that if I is the dyadic interval 
of length 2~ Nk ^ containing x, then for all y & I we have N k (x) = 
Nk(y). Such an interval / is called a tree top and we can form the tree 
T (or stopping time region) consisting of all dyadic intervals contained 
in / but not contained in any smaller tree top. 

For x in a tree top / of length |/| = 2~ Nk ~ 1 ^ we can write 

(26) fi( x Mx) = E ^Tf(x)A J U T g(x) 

Nk-i<i<j<N k i<j 

where / = J2i fii 9 = Z~2j 9j an d denotes the projection onto the 
space spanned by the Haar functions associated to dyadic intervals in 
the tree T. Here we have used that Aif(x) equals the evaluation of the 
projection of / onto a single Haar function depending on i and x and 
for a stopping time (Nk) this Haar function corresponds to an interval 
in the tree T if and only if N^-i < % < Nk. 

In the case of stopping times, we prove strong type <§5§ directly, which 
of course implies weak type (1251) . Inserting (1261) on the left-hand-side 
of (jSJ) and interchanging the order of sum and integration we obtain 

TeT i<j 

By the paraproduct estimate (imp this is bounded by a constant times 

Y ||n T /|| 2 ||n T <7|| 2 . 

TeT 

Applying Cauchy Schwarz in the sum in T and using orthogonality of 
the projections n r this implies §5§. 

5.2. Discrete case, arbitrary (Nk). We will prove weak type esti- 
mate (|25p for arbitrary (Nk) by comparing the set of A jumps for (Nk) 
to the set of A/4 jumps of an adapted stopping time (Nk). Fix mea- 
surable functions Nk, discrete Littlewood Paley families fi and gj, and 
A > 0. For each x choose N (x) sufficiently close to — oo that the max- 
imal paraproduct restricted to i < j < N (x) is pointwise bounded by 
A/4, and the tree tops for iVo partition the real axis. Define recursively 
N k (x) to be the first time past N k -i(x) such that one of the following 
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two conditions is satisfied 

i Y m 9] {x)\>\/A 

N k -i{x)<i<j<N k (x) 



sup 



Y m\\ Y ^)i> A /4 . 



N k -i(.x)<m<N h (x) N k _ l{x)<l < m m<j<N k (x) 

If no such N k (x) exists, we set Nk{%) — ^fe-i(^) + 1- The established 
estimate for stopping times gives good control for the first kind of 
jumps. The integral over the number of the second kind of jumps is 
controlled by 

U{k:\ sup | T (AJixy^ycX^dx 
+ U{k : | sup | E ( A i9(*))\ > c\ 1/2 }dx , 

J Ni,—i(x)<m<Nk(x) . . / x 

where each term can be controlled via ([2]) by C\. 
It remains to show that for every instance 

I Y fi( x )9j(x)\ > A 

V fc _ 1 (a:)<i<j<V fc (x) 

there is a satisfying 

(27) Mfe-ifr) < N~ k (x) < N k (x) . 

Assume the contrary that so such k exists, then by choice of No one 
must have Nq(x) < N k -i(x). Let k be the largest index such that 
^k-i( x ) — Nk-i(x)- Then N k (x) > N k (x) and we have 

I Y fi( x )9j(x)\ < | Y fi( x )9j( x )\ 

A r fc _l(x)<i<j<V fc (x) A r j;_ 1 (^)<i<i<A r fc(a:) 

+ 1 Y fi( X )9j( x )\ 

N k _ 1 (x)<i<j<N k - 1 (x) 

+i Y E 

N k _ 1 (x)<i<N k _ 1 (x) 7Vfe_i(x)<j<Ar fe (a;) 

Each term on the right hand side is less than A/4 by choice of N k , while 
the left-hand side is larger than A. This contradiction proves (!27j) . 
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5.3. Mixed and continuous case. We compare the discrete case 
with the mixed and continuous case of (|25p using an inequality by 
Jones, Seeger and Wright (0), namely that the square function 

sf(x) = (£ w - n * n 2 ) 1 ' 2 

i 

is bounded in L 2 , where Ej = J2j<i an d <Pi * s supported in |£| < 2 N 
and constant 1 on |£| < 2 N ~ 1 . 
We write the telescopic sum 

Y (AJ)(A jg )= (E,-i/-E^_J)(A^) 

N k -!<i<j<N k N k _!<j<N k 

and compare with 

Y (f * <Pi-i ~ f * <PNk-i)( A j9) ■ 
Nk-i<j<N k 

The difference can be estimated with another telescopic sum by 
(28) | Y (%-i/-/*^--i)(A^)| 

N k -i<j<N k 

+\(E Nk _J-f* ( p Nh _ 1 )(E Nh g-E Nh _ 1 g)\ . 
Considering the first term, we estimate 

\%{k:\ Y (%-i/-/*^-i)(A^)|>A} 

N k -x<i<j<N k 

<Ym 3 -if-f*^~i)(A j9 )\ 

3 3 

The L^/(p+<?) 

norm of the right-hand-side can be estimated using Holder's 
inequality and the classical square function estimate together with the 
square function estimate of Jones, Seeger and Wright. Turning to the 
second term in (I28p we have 

H{k : \(E Nk _J - f * ^_J(E^p - E Nk _ l9 )\ > A} 

<\Mk:\E Nk _J-f* ( p Nh J>A 1 / 2 } 

+\%{k: \E Nk g-E Nk _ l9 \ > A 1/2 } . 

Then we estimate the first term by the square function of Jones Seeger 
Wright and the second term using Theorem 11.11 This proves (1251) for 
the mixed convolutional and discrete case. 
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Using a very similar argument one can obtain ( 1251) for the mixed 
discrete (in /) and convolutional (in g) case, and by combination one 
can obtain the pure convolutional case. 

6. Completion of the proof of Theorem 11.21 

For each pair r, s and t = rs/(r + s) the previous section proves The- 
orem [L2] for the specific pair (p,q) = (r,s). Beginning with this pair, 
we will lower and raise the exponents p and q by Calderon-Zygmund 
decomposition and sharp function techniques respectively, as in Section 
[21 This will complete the proof of the theorem. 

We will consider operators S(f, g) and have two different cases: 

(1) The quasi-bi-sublinear case 

k N k -!<i<j<N k 

for t < 1. This case is needed to prove (jHJ) for max(r, s) < 2. 

(2) The equivalent-to-quasi-bi-sublinear case 

S(f,g) = A(|{A; : | fi9 3 \>Mf /t 

JV fc _i<i<i<JV fc 

for arbitrary A and t < 1 . This case is needed to prove (Q when 
max(r, s) = 2. 

The desired estimate now becomes an L pq ^ p+qS> estimate for S(f,g). 
We will discuss the more difficult Case [2] in more detail, the other case 
is similar but easier. 

In this section we focus on the continuous case. The mixed or discrete 
case follows essentially the same arguments, with possible simplifica- 
tions at some places. We shall no further comment on the mixed and 
discrete case. 

Lowering exponents. Fix r, s and first consider Case[2l We work in 
the setting of Theorem 11.31 Assuming (Q holds for specific exponents 
(Po>9o)> we prove © for all p,q with 1 < p < p , 1 < q < q . More 
precisely, passing from S(f, g) to the quasi bi-sublinear operation 

Tx(Lg) = A- : (^pa( £ fa * * g 3 )) 2t ) V< 

k JV fc _i<i<i<iV fc 

similarly as in Section [4] it suffices to prove 

\\Uf,g)\\^_<C\\f\\ p , r \\g\\ q>s . 
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By Proposition 13.11 it suffices to prove for all 1 < p < po, 1 < q < go 
and all /x > the weak type bound 

\{x : T x (f,g) > fi}\ < Cti~ p<!/(p+q) \\f\\ P ,r\\9\U,s ■ 
As in Section 0] it suffices to prove 

{\\{k:\ (^*/*)(^*»)}l>A}r / *>A*} 

N h -!<i<j<N k 

<Cv- pqlM \\f\\pA\9\\ q ,s ■ 

By symmetry of the argument, it suffices to fix q = q and vary p. 
By simple interpolation it suffices to consider the extremal cases. The 
endpoint p = po is true by assumption, so we may assume p = 1. As 
in Section H] we may assume <£T§ and /i = 1. 

Split / = (/j) into a Calderon-Zygmund decomposition at level 1 
as in Section [2j The good function is in L po and we can apply the 
known estimate and Chebysheff. The bad function satisfies outside the 
expanded bad set the estimate 

(29) ||X>*^IIUH(U3/)<)<C|I/Ikr- • 

i 

Applying Proposition 14.11 to the sequences ((0j * 6i)l(y 3 /) C ) and (g 3 ) 
we get the desired estimate for the bad function. This completes the 
proof of the weak type estimate for p = 1. 

In Case [1] we proceed similarly, except we work directly with the 
quasi sublinear expression 

T X {f,g) = (Ei E (^*^)(^**))r) 1A • 

k N k _!<i<j<N k 

Raising the exponents in Case [21 In conjunction with the previous 
subsection, it suffices to prove Q for every p > r and q > s satisfying 
the "diagonal" condition 

(30) £=«, 

r s 

which includes pairs with arbitrarily large p and q. Consider the func- 
tion 

t a (/,s)(x) = su P (Epa( E (&*Mx) (<Pj * 9 3 )(x)) 2t )^ . 

(7Vfe) k N k _ x <i<i<N k 



Similarly as before, (Q follows from bounds 

\pq/(p+q) 



^ 1 ^A(/,5 , ) 2 ||pg/(p+ (? ) < C 
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under the normalization assumption (J7|). These are 

for T\(f,g) which we can prove by the corresponding bound for the 

sharp function T\(f,g)K 

The desired estimate now follows from bounds 



(31) 



X-\T x (f,gff 
<CM r (C£\fi\ r ) 1/r ) M s ((^| 5i | s )^) 



-C sup 



sup \{k : 

I N k I 



Yl (fc* fi)( ( t ) i*9j-9i,i)\> A} 

N k -!<i,j<N k 

pointwise at every x. Here we have defined the function gjj with respect 
to the interval / as in Section [2J 

We will first show how to estimate the right-hand-side of (I3ip . The 
first term satisfies the desired L pq /^ p+q ^ bound by Holder and the max- 
imal theorem. We estimate the second term pointwise by a constant 
times 

M 1/2 (supA {k: | V i */,|>A' /r } 1/< ) 



N k -i<i<N k 



+ sup \— / sup A 1/2 {k : | 2^ fa* 9j - 9j,i\ > A * /S ) J 

Since pq/ (p + q) > 1/2, we can estimate the L pq ^ p+q ^ norm of the first 
term using the maximal theorem adapted to M1/2, by 



C|| 8 upA|{fc: I V &*/,| >\ t / r }\ l ' t \ 

N k ^ 



PQ 
P+1 



N k _!<i<N k 

Observe that = p thanks to (130). So by © with A* /r in place of 

A we can estimate the last display by 

cn(x;i/ i r) i/p ii; /t =c . 

i 

To estimate the gy term, we decompose as in Section [2] 



[i] 



[3] 



i ["2] 

To estimate the contribution of kj , we'll use a weak-type variant of 
the argument in Section [2j First, by adapting the sequence Nk one can 
bound this contribution by 



sup 



sup 

N k 



/« 



N k -!<j<N k 



s/t 

L*/(»)(Z) 
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and since t > 1/2 we can use Holder to increase L S /( 2 *)(I) to L S (I), 



< sup 



I|V* 



sup X t/s \{k 



s/t 

mi) 



J2 <P^(9M)\>^ /S }\ 1/S 

N k -!<j<N k 

The point is that now we can use the diagonal case of ([2]) (with A t//s in 
place of A). This gives the following estimate for the last display: 



< C sup 



J|i/* 



(E to 1 



37, 



< 



c(M(£\ 9i \ 



l/t 



Since 



pq 



t(p+q) 



- > 1, we can estimate the LPilw+i) norm of the last 



display using the maximal theorem, by 

<C\\Y,\9nt = C \\9\\t = C ■ 



The arguments for estimating the contributions of k^ and k^ will be 
similar to each other, and below we'll only show details for the latter. 
The contribution of fcj 1 ' is estimated using decay in j as in Section [2j 

by 



< 



1 I I u 1 1 v k u AT, . ^ A. 



fc [l]|sU/(2t) 



N k -i<j<N k 



< 



\s/(2t) 



Csup (^r l(J2\ k f ] \y m ) 2 ^ c *f (^/ (s 7 M * 

<CM 1/2 ((supM ft ) sA )(^) 

j 

which satisfies the desired LP9/(p+<?) bound by the maximal theorem and 
(|30|) . This concludes the bound of the right-hand-side of (l3Tj) . 

It remains to prove the bound (I3T1) for the sharp function. Fix an 
interval I and define constant functions / and g as in Section [2j We'll 
use the constant 

7. \2t\ 



I 2< 



c := sup(J^ fi gj) 

k N k _i<i<j<N k 

for the sharp function estimate. By metric properties we obtain 

\T x (f,g)-c\ 

(32) < Tx (f-f,g-~g)+T x (f-f,~g)+T x (f,g-~g) . 

Focusing on the first term in fl32l) . we estimate the average 

A-^ir / T x {f-f,g-~g){x)dxf 
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using the equivalent estimate with 



1 

\T\' 



sup A 



{k:\ (<t>i*fi- fi){4>3 * 9j -9j) I > A} 

iV fc _i<i<j<JV fc 



LV2(J) 



We'll show that the contribution of T\(f — f,g — g) is bounded by the 
product of two maximal functions on the right-hand side of (l3Tj) . We 
split as in Section [2] 

4>i*fi-Ji = hf + hf ] + hf , 

^*g 3 -~g 3 = kf + kf+kf . 

The estimate for the term involving hft and k^ follows from Holder 
(to increase L 1//2 to L l ) and the known estimate for the diagonal case 
(p, q) = (r, s). For all other estimates we can use the 1-variation bound 
of the error terms as in Section [2] in conjunction with the endpoint esti- 
mates of Proposition 14.11 The key idea is to incorporate the restriction 
to the interval / into the error terms before applying this Proposition, 
because we want to preserve locality for future estimates by maximal 
functions. For instance, to estimate the combination (h\ 2 \ k^) we first 
equivalently replace h? by kjli, then increase L 1 / 2 to L r ^ r+1 ^ and 
decrease t to then estimate using Proposition 14. 1\ by 

< C'-^ w ||(/a 3 /)||^||(4 3l l/)||i,i < CM r (C£\fi\ r ) 1/r )supM gj . 

II j 3 

Turning to the second term in fl32|) . note that by support of g~j we may 
restrict attention to j < — log 2 |/|. Thanks to the constraint i < j, only 
the term hf^ in the splitting of the function (pi* /j — fa then appears in 
the summation. While g 3 itself does not have geometric decay in j, the 
geometric decay of hf^ in i is sufficient to obtain geometric summability 
in both i and j thanks to the constraint i < j. Specifically, using 
t > 1/2 and Holder, 



1 ' Jl ' ' Jl i<J<-log 2 |/| 

- C {\T\j I { £ (m)^Mf t (y)infMg 3 (y) 



i<j<-log 2 |7| 

< C sup Mfi(x) sup Mgj(x) 
* j 
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This proves the desired estimate for the second term. Combining the 
first and second terms in (1521) we also obtain bounds for the variant of 
the second term with g replaced by g. 

Turning to the last term in (I32I) . we may also replace / with /. To 
apply the same argument as for the second term, we need to replace 
the constraint i < j by % > j, which can be done via triangle inequality 
and estimates on the unconstrained expression 



1 



sup A 



{k:\ 22 (&*/*) (0i * 9j - 9j) I > A} 

N k -!<i,j<N k 



LV2(7) 



This term can be estimated by the second term on the right-hand-side 
of (13"TT) . This completes the proof of the pointwise bound for the sharp 
function. 

Raising exponents for Case [JJ. The argument in this section is 
similar to and simpler than last section and we'll only comment on 
the necessary changes. Consider a pair (p, q) such that r < p < oo, 
s < q < oo. To show the desired Lp+i bound for 

S(f,g)= (sup ^ | J2 

Nk k Af fc _i<i<i<Affc 

we'll show the corresponding bound for T(f, g)\ where T(f, g) = S(f, g) 1 ^ 2 - 
Below we'll show the pointwise bound: 

(33) T(J,g)> < c(M r ((J2 Wf^M^ 

1 /2 

+c(M(||0 i */ i || y ,)M s ((X;iftf) 1/s )) • 

j 

Using this inequality, the desired Lp+i bound for T(f,g)* follows from 
Holder's inequality, the maximal theorem and Theorem ll.il 

Below we show ( 133]) . Fix any interval / and define / and g as in Sec- 
tion 12 We'll use the following constant for the sharp function estimate 

Nk k N h -i<i<j<N k 

Since | < t < 1, both and | a; | 1 ' ( 2 *J define a norm, so T(f,g) is 
sublinear. We actually do not need t < 1 for this to be true, because 
when t > 1 we could combine the |x| 1//2 and £*(Z) norms. We then have 

\T(f, g)-c\< T(f -f,g-g) + T(f - /, ~g) + T(f, g - g) . 



22 



Y. DO, C. MUSCALU, C. THIELE 



The contributions of T(f — f,g — g) and of T(f — f, g) can be esti- 
mated as in the consideration of Case [2] in the last section. To estimate 
the contribution of T(f,g — g), as before it suffices to estimate the 
unconstrained expression: 



— I sup(E | fi * 9j - 9j) 

11 Jl Nk k N k ^<i,j<N k 

By Holder's inequality we can estimate this by 

<sup(Ei E ^i r ^i7i / ( su pEi E ^3*93-93) 

Nk k Af fc _i<i<Af fc 1 1 Jl Nk k N k -i<i<N k 



Using the estimates in Section |2] we can estimate the last display by 

1/2/ , A l/2 

<>. > . //I'')"] 



<c(su P C£\ E /^)") 1/2 (^((Ei^i s ) 1/s )) : 



N k _!<i<N k 



Now, since fi — m J I 4 > i* fi for i < — log 2 |/| and fi = otherwise, by 
Minkowski's inequality we can estimate the last display by 



- c (w\ / (sup Ei E ^»/*i p )') 1/2 (M.((Ebii*) 1/ 

1 1 Jl Nk k N k _ 1 <i<N k 



1/2 
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